Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 100000 |
| Missing cells | 1752 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 22 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 16.8 MiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 10 |
|---|---|
| DateTime | 1 |
| Text | 6 |
| Categorical | 4 |
| Dataset has 22 (< 0.1%) duplicate rows | Duplicates |
price is highly overall correlated with cost | High correlation |
cost is highly overall correlated with price | High correlation |
nal is highly overall correlated with electron | High correlation |
electron is highly overall correlated with nal | High correlation |
Тип города is highly imbalanced (71.4%) | Imbalance |
Тип улицы is highly imbalanced (69.3%) | Imbalance |
Тип номера дома is highly imbalanced (70.1%) | Imbalance |
amount is highly skewed (γ1 = 122.4826947) | Skewed |
price is highly skewed (γ1 = 201.5919024) | Skewed |
cost is highly skewed (γ1 = 201.4310729) | Skewed |
nal is highly skewed (γ1 = 35.09836797) | Skewed |
electron is highly skewed (γ1 = 213.3822051) | Skewed |
avans is highly skewed (γ1 = 270.6520885) | Skewed |
credit is highly skewed (γ1 = 134.3508037) | Skewed |
vstrechpredst is highly skewed (γ1 = 185.7721841) | Skewed |
nal has 23409 (23.4%) zeros | Zeros |
electron has 76671 (76.7%) zeros | Zeros |
avans has 99924 (99.9%) zeros | Zeros |
credit has 99961 (> 99.9%) zeros | Zeros |
vstrechpredst has 99976 (> 99.9%) zeros | Zeros |
Reproduction
| Analysis started | 2023-08-21 16:02:25.230223 |
|---|---|
| Analysis finished | 2023-08-21 16:03:25.290384 |
| Duration | 1 minute and 0.06 seconds |
| Software version | ydata-profiling vv4.5.1 |
| Download configuration | config.json |
receiptid
Real number (ℝ)
| Distinct | 99660 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70755830 |
| Minimum | 37061558 |
|---|---|
| Maximum | 83709978 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 37061558 |
|---|---|
| 5-th percentile | 60184061 |
| Q1 | 64890920 |
| median | 70702210 |
| Q3 | 76693542 |
| 95-th percentile | 81205664 |
| Maximum | 83709978 |
| Range | 46648420 |
| Interquartile range (IQR) | 11802622 |
Descriptive statistics
| Standard deviation | 6794096.3 |
|---|---|
| Coefficient of variation (CV) | 0.096021717 |
| Kurtosis | -1.1802868 |
| Mean | 70755830 |
| Median Absolute Deviation (MAD) | 5901096 |
| Skewness | -0.0064540909 |
| Sum | 7.075583 × 1012 |
| Variance | 4.6159744 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62692882 | 3 | < 0.1% |
| 66248040 | 3 | < 0.1% |
| 71130028 | 3 | < 0.1% |
| 62405425 | 3 | < 0.1% |
| 66899557 | 3 | < 0.1% |
| 71267380 | 2 | < 0.1% |
| 68888703 | 2 | < 0.1% |
| 69133732 | 2 | < 0.1% |
| 70688762 | 2 | < 0.1% |
| 70022955 | 2 | < 0.1% |
| Other values (99650) | 99975 |
| Value | Count | Frequency (%) |
| 37061558 | 1 | |
| 37061956 | 1 | |
| 47521641 | 1 | |
| 47612297 | 1 | |
| 48131467 | 1 | |
| 48362674 | 1 | |
| 48531350 | 1 | |
| 48555146 | 1 | |
| 49009685 | 1 | |
| 49077562 | 1 |
| Value | Count | Frequency (%) |
| 83709978 | 1 | |
| 83706756 | 1 | |
| 83681569 | 1 | |
| 83680467 | 1 | |
| 83679398 | 1 | |
| 83660199 | 1 | |
| 83657858 | 1 | |
| 83637242 | 1 | |
| 83632874 | 1 | |
| 83632128 | 1 |
kkt_sn
Real number (ℝ)
| Distinct | 9092 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0822448 × 1011 |
| Minimum | 1.0620555 × 1011 |
|---|---|
| Maximum | 1.0849 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1.0620555 × 1011 |
|---|---|
| 5-th percentile | 1.0820052 × 1011 |
| Q1 | 1.082028 × 1011 |
| median | 1.0820582 × 1011 |
| Q3 | 1.0820813 × 1011 |
| 95-th percentile | 1.084071 × 1011 |
| Maximum | 1.0849 × 1011 |
| Range | 2.2844521 × 109 |
| Interquartile range (IQR) | 5336120.2 |
Descriptive statistics
| Standard deviation | 1.1874324 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.0010971939 |
| Kurtosis | 172.84681 |
| Mean | 1.0822448 × 1011 |
| Median Absolute Deviation (MAD) | 2679861 |
| Skewness | -10.476101 |
| Sum | 1.0822448 × 1016 |
| Variance | 1.4099958 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.084028694 × 1011 | 578 | 0.6% |
| 1.082022714 × 1011 | 528 | 0.5% |
| 1.084075406 × 1011 | 436 | 0.4% |
| 1.082020448 × 1011 | 406 | 0.4% |
| 1.084024938 × 1011 | 328 | 0.3% |
| 1.084075348 × 1011 | 273 | 0.3% |
| 1.08403534 × 1011 | 272 | 0.3% |
| 1.084075431 × 1011 | 271 | 0.3% |
| 1.084099522 × 1011 | 270 | 0.3% |
| 1.082016051 × 1011 | 255 | 0.3% |
| Other values (9082) | 96383 |
| Value | Count | Frequency (%) |
| 1.06205548 × 1011 | 172 | |
| 1.067027559 × 1011 | 21 | < 0.1% |
| 1.067035026 × 1011 | 48 | < 0.1% |
| 1.067058852 × 1011 | 48 | < 0.1% |
| 1.082 × 1011 | 5 | < 0.1% |
| 1.082 × 1011 | 1 | < 0.1% |
| 1.082000001 × 1011 | 2 | < 0.1% |
| 1.082000002 × 1011 | 2 | < 0.1% |
| 1.082000003 × 1011 | 9 | < 0.1% |
| 1.082000004 × 1011 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.084900001 × 1011 | 1 | < 0.1% |
| 1.084099522 × 1011 | 270 | |
| 1.084098702 × 1011 | 11 | < 0.1% |
| 1.084098196 × 1011 | 215 | |
| 1.084098085 × 1011 | 5 | < 0.1% |
| 1.084097983 × 1011 | 2 | < 0.1% |
| 1.084097873 × 1011 | 7 | < 0.1% |
| 1.084097565 × 1011 | 5 | < 0.1% |
| 1.084097508 × 1011 | 1 | < 0.1% |
| 1.08409745 × 1011 | 1 | < 0.1% |
d_date
Date
| Distinct | 62 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Minimum | 2018-06-01 00:00:00 |
|---|---|
| Maximum | 2018-08-01 00:00:00 |
name
Text
| Distinct | 11869 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
Length
| Max length | 128 |
|---|---|
| Median length | 123 |
| Mean length | 18.67016 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1867016 |
|---|---|
| Distinct characters | 154 |
| Distinct categories | 13 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 6658 ? |
|---|---|
| Unique (%) | 6.7% |
Sample
| 1st row | 0001 СТРОИТЕЛЬНЫЕ И ОТДЕЛОЧНЫЕ МАТЕРИАЛЫ |
|---|---|
| 2nd row | 0001 ТОВАР |
| 3rd row | 0001 ЭЛЕКТРОЭНЕРГИЯ |
| 4th row | 0001 ТОВАР |
| 5th row | 0001 ТОВАР ПО СВОБОДНОЙ ЦЕНЕ |
| Value | Count | Frequency (%) |
| 0001 | 63550 | 21.5% |
| товар | 29324 | 9.9% |
| 0002 | 8694 | 2.9% |
| изделия | 6263 | 2.1% |
| продукты | 4707 | 1.6% |
| и | 3149 | 1.1% |
| услуги | 2899 | 1.0% |
| товары | 2876 | 1.0% |
| 0003 | 2826 | 1.0% |
| в | 2781 | 0.9% |
| Other values (13237) | 168949 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 279427 | 15.0% |
| 202111 | 10.8% | |
| О | 88496 | 4.7% |
| А | 81265 | 4.4% |
| 1 | 78943 | 4.2% |
| Т | 68824 | 3.7% |
| Р | 62612 | 3.4% |
| Е | 55941 | 3.0% |
| И | 49092 | 2.6% |
| В | 46473 | 2.5% |
| Other values (144) | 853832 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 831786 | |
| Decimal Number | 426178 | |
| Lowercase Letter | 370314 | |
| Space Separator | 202111 | 10.8% |
| Other Punctuation | 26002 | 1.4% |
| Dash Punctuation | 5136 | 0.3% |
| Close Punctuation | 2462 | 0.1% |
| Open Punctuation | 2456 | 0.1% |
| Other Symbol | 307 | < 0.1% |
| Math Symbol | 210 | < 0.1% |
| Other values (3) | 54 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| О | 88496 | 10.6% |
| А | 81265 | 9.8% |
| Т | 68824 | 8.3% |
| Р | 62612 | 7.5% |
| Е | 55941 | 6.7% |
| И | 49092 | 5.9% |
| В | 46473 | 5.6% |
| Н | 37516 | 4.5% |
| Л | 36987 | 4.4% |
| С | 32371 | 3.9% |
| Other values (49) | 272209 |
Lowercase Letter
| Value | Count | Frequency (%) |
| о | 44005 | 11.9% |
| а | 34105 | 9.2% |
| е | 28903 | 7.8% |
| р | 26953 | 7.3% |
| и | 21567 | 5.8% |
| н | 19673 | 5.3% |
| к | 18391 | 5.0% |
| т | 17383 | 4.7% |
| с | 17340 | 4.7% |
| л | 16262 | 4.4% |
| Other values (49) | 125732 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8497 | |
| " | 6044 | |
| , | 5831 | |
| / | 2848 | 11.0% |
| % | 2052 | 7.9% |
| * | 233 | 0.9% |
| : | 132 | 0.5% |
| \ | 120 | 0.5% |
| ? | 106 | 0.4% |
| ! | 47 | 0.2% |
| Other values (3) | 92 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 279427 | |
| 1 | 78943 | 18.5% |
| 2 | 18809 | 4.4% |
| 3 | 10586 | 2.5% |
| 5 | 10025 | 2.4% |
| 4 | 8366 | 2.0% |
| 6 | 5722 | 1.3% |
| 9 | 4817 | 1.1% |
| 8 | 4777 | 1.1% |
| 7 | 4706 | 1.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 126 | |
| ~ | 53 | |
| = | 20 | 9.5% |
| < | 10 | 4.8% |
| > | 1 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 202111 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5136 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2462 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2456 |
Other Symbol
| Value | Count | Frequency (%) |
| № | 307 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 24 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 23 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 1184413 | |
| Common | 664916 | |
| Latin | 17687 | 0.9% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| О | 88496 | 7.5% |
| А | 81265 | 6.9% |
| Т | 68824 | 5.8% |
| Р | 62612 | 5.3% |
| Е | 55941 | 4.7% |
| И | 49092 | 4.1% |
| В | 46473 | 3.9% |
| о | 44005 | 3.7% |
| Н | 37516 | 3.2% |
| Л | 36987 | 3.1% |
| Other values (56) | 613202 |
Latin
| Value | Count | Frequency (%) |
| e | 945 | 5.3% |
| i | 786 | 4.4% |
| o | 709 | 4.0% |
| S | 708 | 4.0% |
| l | 692 | 3.9% |
| a | 690 | 3.9% |
| E | 639 | 3.6% |
| s | 624 | 3.5% |
| R | 618 | 3.5% |
| m | 607 | 3.4% |
| Other values (42) | 10669 |
Common
| Value | Count | Frequency (%) |
| 0 | 279427 | |
| 202111 | ||
| 1 | 78943 | 11.9% |
| 2 | 18809 | 2.8% |
| 3 | 10586 | 1.6% |
| 5 | 10025 | 1.5% |
| . | 8497 | 1.3% |
| 4 | 8366 | 1.3% |
| " | 6044 | 0.9% |
| , | 5831 | 0.9% |
| Other values (26) | 36277 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 1184413 | |
| ASCII | 682296 | |
| Letterlike Symbols | 307 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 279427 | |
| 202111 | ||
| 1 | 78943 | 11.6% |
| 2 | 18809 | 2.8% |
| 3 | 10586 | 1.6% |
| 5 | 10025 | 1.5% |
| . | 8497 | 1.2% |
| 4 | 8366 | 1.2% |
| " | 6044 | 0.9% |
| , | 5831 | 0.9% |
| Other values (77) | 53657 | 7.9% |
Cyrillic
| Value | Count | Frequency (%) |
| О | 88496 | 7.5% |
| А | 81265 | 6.9% |
| Т | 68824 | 5.8% |
| Р | 62612 | 5.3% |
| Е | 55941 | 4.7% |
| И | 49092 | 4.1% |
| В | 46473 | 3.9% |
| о | 44005 | 3.7% |
| Н | 37516 | 3.2% |
| Л | 36987 | 3.1% |
| Other values (56) | 613202 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 307 |
amount
Real number (ℝ)
SKEWED 
| Distinct | 1412 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3178273 |
| Minimum | 0.001 |
|---|---|
| Maximum | 8355 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0.001 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 8355 |
| Range | 8354.999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 42.043469 |
|---|---|
| Coefficient of variation (CV) | 18.139173 |
| Kurtosis | 20463.272 |
| Mean | 2.3178273 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 122.48269 |
| Sum | 231782.73 |
| Variance | 1767.6533 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 91472 | |
| 2 | 2288 | 2.3% |
| 3 | 675 | 0.7% |
| 4 | 365 | 0.4% |
| 5 | 290 | 0.3% |
| 10 | 179 | 0.2% |
| 6 | 138 | 0.1% |
| 0.5 | 97 | 0.1% |
| 20 | 76 | 0.1% |
| 30 | 69 | 0.1% |
| Other values (1402) | 4351 | 4.4% |
| Value | Count | Frequency (%) |
| 0.001 | 10 | |
| 0.003 | 1 | < 0.1% |
| 0.007 | 1 | < 0.1% |
| 0.01 | 2 | < 0.1% |
| 0.012 | 1 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.021 | 1 | < 0.1% |
| 0.023 | 1 | < 0.1% |
| 0.026 | 1 | < 0.1% |
| 0.028 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8355 | 1 | |
| 6000 | 1 | |
| 2935 | 1 | |
| 2371 | 1 | |
| 2285.53 | 1 | |
| 2224 | 1 | |
| 1979 | 1 | |
| 1966 | 1 | |
| 1950 | 1 | |
| 1925.6 | 1 |
unit
Text
| Distinct | 62 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 2 |
| Mean length | 2.49388 |
| Min length | 1 |
Characters and Unicode
| Total characters | 249388 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | -- |
|---|---|
| 2nd row | -- |
| 3rd row | -- |
| 4th row | штука |
| 5th row | -- |
| Value | Count | Frequency (%) |
| 80094 | ||
| штука | 8925 | 8.9% |
| шт | 4327 | 4.3% |
| килограмм | 2905 | 2.9% |
| кг | 2652 | 2.7% |
| литр | 393 | 0.4% |
| порц | 128 | 0.1% |
| штук | 105 | 0.1% |
| метр | 93 | 0.1% |
| л | 77 | 0.1% |
| Other values (36) | 306 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 159518 | |
| к | 14655 | 5.9% |
| т | 13883 | 5.6% |
| ш | 13126 | 5.3% |
| а | 11946 | 4.8% |
| у | 9185 | 3.7% |
| м | 5955 | 2.4% |
| г | 5593 | 2.2% |
| р | 3590 | 1.4% |
| л | 3463 | 1.4% |
| Other values (34) | 8474 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Dash Punctuation | 159518 | |
| Lowercase Letter | 88565 | |
| Other Punctuation | 979 | 0.4% |
| Uppercase Letter | 262 | 0.1% |
| Decimal Number | 57 | < 0.1% |
| Space Separator | 5 | < 0.1% |
| Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| к | 14655 | |
| т | 13883 | |
| ш | 13126 | |
| а | 11946 | |
| у | 9185 | |
| м | 5955 | |
| г | 5593 | 6.3% |
| р | 3590 | 4.1% |
| л | 3463 | 3.9% |
| и | 3310 | 3.7% |
| Other values (14) | 3859 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Ш | 231 | |
| Л | 8 | 3.1% |
| Ч | 8 | 3.1% |
| У | 6 | 2.3% |
| К | 4 | 1.5% |
| М | 2 | 0.8% |
| Т | 2 | 0.8% |
| В | 1 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 31 | |
| 3 | 11 | 19.3% |
| 5 | 7 | 12.3% |
| 0 | 3 | 5.3% |
| 7 | 3 | 5.3% |
| 1 | 2 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 633 | |
| " | 333 | |
| / | 13 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 159518 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 160561 | |
| Cyrillic | 88827 |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| к | 14655 | |
| т | 13883 | |
| ш | 13126 | |
| а | 11946 | |
| у | 9185 | |
| м | 5955 | |
| г | 5593 | 6.3% |
| р | 3590 | 4.0% |
| л | 3463 | 3.9% |
| и | 3310 | 3.7% |
| Other values (22) | 4121 | 4.6% |
Common
| Value | Count | Frequency (%) |
| - | 159518 | |
| . | 633 | 0.4% |
| " | 333 | 0.2% |
| 2 | 31 | < 0.1% |
| / | 13 | < 0.1% |
| 3 | 11 | < 0.1% |
| 5 | 7 | < 0.1% |
| 5 | < 0.1% | |
| 0 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 160561 | |
| Cyrillic | 88827 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 159518 | |
| . | 633 | 0.4% |
| " | 333 | 0.2% |
| 2 | 31 | < 0.1% |
| / | 13 | < 0.1% |
| 3 | 11 | < 0.1% |
| 5 | 7 | < 0.1% |
| 5 | < 0.1% | |
| 0 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Cyrillic
| Value | Count | Frequency (%) |
| к | 14655 | |
| т | 13883 | |
| ш | 13126 | |
| а | 11946 | |
| у | 9185 | |
| м | 5955 | |
| г | 5593 | 6.3% |
| р | 3590 | 4.0% |
| л | 3463 | 3.9% |
| и | 3310 | 3.7% |
| Other values (22) | 4121 | 4.6% |
price
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 8998 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1415.7003 |
| Minimum | 0 |
|---|---|
| Maximum | 13230000 |
| Zeros | 79 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 50 |
| median | 128 |
| Q3 | 379 |
| 95-th percentile | 3000 |
| Maximum | 13230000 |
| Range | 13230000 |
| Interquartile range (IQR) | 329 |
Descriptive statistics
| Standard deviation | 53215.042 |
|---|---|
| Coefficient of variation (CV) | 37.589199 |
| Kurtosis | 45560.746 |
| Mean | 1415.7003 |
| Median Absolute Deviation (MAD) | 98 |
| Skewness | 201.5919 |
| Sum | 1.4157003 × 108 |
| Variance | 2.8318407 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 2119 | 2.1% |
| 30 | 1908 | 1.9% |
| 50 | 1530 | 1.5% |
| 20 | 1490 | 1.5% |
| 150 | 1408 | 1.4% |
| 200 | 1257 | 1.3% |
| 60 | 1114 | 1.1% |
| 40 | 1098 | 1.1% |
| 90 | 1094 | 1.1% |
| 85 | 1070 | 1.1% |
| Other values (8988) | 85912 |
| Value | Count | Frequency (%) |
| 0 | 79 | |
| 0.01 | 7 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.04 | 5 | < 0.1% |
| 0.05 | 4 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.08 | 1 | < 0.1% |
| 0.09 | 2 | < 0.1% |
| 0.18 | 1 | < 0.1% |
| 0.21 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 13230000 | 1 | |
| 8704899.2 | 1 | |
| 3060000 | 1 | |
| 3000000 | 1 | |
| 1149579.6 | 1 | |
| 1070000 | 1 | |
| 1000000 | 1 | |
| 900000 | 1 | |
| 852550 | 1 | |
| 700000 | 1 |
cost
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 11444 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1449.2929 |
| Minimum | 0 |
|---|---|
| Maximum | 13230000 |
| Zeros | 79 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 16.4 |
| Q1 | 54 |
| median | 135 |
| Q3 | 380.365 |
| 95-th percentile | 3000 |
| Maximum | 13230000 |
| Range | 13230000 |
| Interquartile range (IQR) | 326.365 |
Descriptive statistics
| Standard deviation | 53229.124 |
|---|---|
| Coefficient of variation (CV) | 36.72765 |
| Kurtosis | 45512.042 |
| Mean | 1449.2929 |
| Median Absolute Deviation (MAD) | 103 |
| Skewness | 201.43107 |
| Sum | 1.4492929 × 108 |
| Variance | 2.8333396 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 2088 | 2.1% |
| 30 | 1836 | 1.8% |
| 50 | 1503 | 1.5% |
| 20 | 1427 | 1.4% |
| 150 | 1406 | 1.4% |
| 200 | 1326 | 1.3% |
| 60 | 1172 | 1.2% |
| 300 | 1109 | 1.1% |
| 40 | 1074 | 1.1% |
| 90 | 1066 | 1.1% |
| Other values (11434) | 85993 |
| Value | Count | Frequency (%) |
| 0 | 79 | |
| 0.01 | 8 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 7 | < 0.1% |
| 0.05 | 4 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.08 | 1 | < 0.1% |
| 0.09 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 13230000 | 1 | |
| 8704899.2 | 1 | |
| 3060000 | 1 | |
| 3000000 | 1 | |
| 1149579.6 | 1 | |
| 1070000 | 1 | |
| 1000000 | 1 | |
| 900000 | 1 | |
| 852550 | 1 | |
| 700000 | 1 |
nal
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 10703 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1219.3027 |
| Minimum | 0 |
|---|---|
| Maximum | 1153213.6 |
| Zeros | 23409 |
| Zeros (%) | 23.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 14 |
| median | 100 |
| Q3 | 325 |
| 95-th percentile | 2365 |
| Maximum | 1153213.6 |
| Range | 1153213.6 |
| Interquartile range (IQR) | 311 |
Descriptive statistics
| Standard deviation | 13971.387 |
|---|---|
| Coefficient of variation (CV) | 11.458506 |
| Kurtosis | 1797.7944 |
| Mean | 1219.3027 |
| Median Absolute Deviation (MAD) | 100 |
| Skewness | 35.098368 |
| Sum | 1.2193027 × 108 |
| Variance | 1.9519966 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 23409 | 23.4% |
| 100 | 1529 | 1.5% |
| 30 | 1262 | 1.3% |
| 50 | 1085 | 1.1% |
| 150 | 1019 | 1.0% |
| 200 | 990 | 1.0% |
| 20 | 972 | 1.0% |
| 300 | 842 | 0.8% |
| 60 | 812 | 0.8% |
| 90 | 730 | 0.7% |
| Other values (10693) | 67350 |
| Value | Count | Frequency (%) |
| 0 | 23409 | |
| 0.01 | 8 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.08 | 1 | < 0.1% |
| 0.09 | 2 | < 0.1% |
| 0.15 | 1 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1153213.6 | 1 | |
| 1070000 | 1 | |
| 1000000 | 1 | |
| 852550 | 1 | |
| 700000 | 1 | |
| 640601 | 1 | |
| 602382.2 | 1 | |
| 582194.97 | 1 | |
| 578787.8 | 1 | |
| 578735.35 | 1 |
electron
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 6757 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 677.11432 |
| Minimum | 0 |
|---|---|
| Maximum | 13230000 |
| Zeros | 76671 |
| Zeros (%) | 76.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1035 |
| Maximum | 13230000 |
| Range | 13230000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 52182.107 |
|---|---|
| Coefficient of variation (CV) | 77.065431 |
| Kurtosis | 49281.549 |
| Mean | 677.11432 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 213.38221 |
| Sum | 67711432 |
| Variance | 2.7229723 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 76671 | |
| 100 | 242 | 0.2% |
| 150 | 223 | 0.2% |
| 200 | 210 | 0.2% |
| 300 | 179 | 0.2% |
| 500 | 168 | 0.2% |
| 120 | 159 | 0.2% |
| 90 | 152 | 0.2% |
| 130 | 142 | 0.1% |
| 400 | 139 | 0.1% |
| Other values (6747) | 21715 | 21.7% |
| Value | Count | Frequency (%) |
| 0 | 76671 | |
| 0.21 | 2 | < 0.1% |
| 1 | 33 | < 0.1% |
| 1.5 | 1 | < 0.1% |
| 2 | 30 | < 0.1% |
| 2.5 | 1 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 8 | < 0.1% |
| 5.7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 13230000 | 1 | |
| 8704899.2 | 1 | |
| 3060000 | 1 | |
| 3000000 | 1 | |
| 900000 | 1 | |
| 605715 | 1 | |
| 525000 | 1 | |
| 431775.43 | 1 | |
| 400000 | 1 | |
| 365411.25 | 1 |
avans
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7710837 |
| Minimum | 0 |
|---|---|
| Maximum | 70000 |
| Zeros | 99924 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 70000 |
| Range | 70000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 235.12207 |
|---|---|
| Coefficient of variation (CV) | 132.75605 |
| Kurtosis | 79091.766 |
| Mean | 1.7710837 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 270.65209 |
| Sum | 177108.37 |
| Variance | 55282.389 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 99924 | |
| 10 | 6 | < 0.1% |
| 900 | 4 | < 0.1% |
| 800 | 4 | < 0.1% |
| 3320 | 3 | < 0.1% |
| 5000 | 3 | < 0.1% |
| 340 | 2 | < 0.1% |
| 180 | 2 | < 0.1% |
| 510 | 2 | < 0.1% |
| 410 | 2 | < 0.1% |
| Other values (46) | 48 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 99924 | |
| 2 | 1 | < 0.1% |
| 10 | 6 | < 0.1% |
| 100 | 1 | < 0.1% |
| 150 | 1 | < 0.1% |
| 155 | 1 | < 0.1% |
| 180 | 2 | < 0.1% |
| 219 | 1 | < 0.1% |
| 220 | 1 | < 0.1% |
| 236.34 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 70000 | 1 | < 0.1% |
| 20000 | 1 | < 0.1% |
| 5000 | 3 | |
| 4000 | 1 | < 0.1% |
| 3710 | 1 | < 0.1% |
| 3635 | 1 | < 0.1% |
| 3320 | 3 | |
| 3200 | 1 | < 0.1% |
| 3000 | 2 | |
| 2750 | 1 | < 0.1% |
credit
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9035706 |
| Minimum | 0 |
|---|---|
| Maximum | 77001 |
| Zeros | 99961 |
| Zeros (%) | > 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 77001 |
| Range | 77001 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 421.05115 |
|---|---|
| Coefficient of variation (CV) | 107.86308 |
| Kurtosis | 20090.766 |
| Mean | 3.9035706 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 134.3508 |
| Sum | 390357.06 |
| Variance | 177284.07 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 99961 | |
| 5 | 6 | < 0.1% |
| 100 | 3 | < 0.1% |
| 160 | 2 | < 0.1% |
| 31940 | 1 | < 0.1% |
| 90 | 1 | < 0.1% |
| 2000 | 1 | < 0.1% |
| 400 | 1 | < 0.1% |
| 250 | 1 | < 0.1% |
| 15294 | 1 | < 0.1% |
| Other values (22) | 22 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 99961 | |
| 5 | 6 | < 0.1% |
| 90 | 1 | < 0.1% |
| 95 | 1 | < 0.1% |
| 100 | 3 | < 0.1% |
| 115 | 1 | < 0.1% |
| 121 | 1 | < 0.1% |
| 135 | 1 | < 0.1% |
| 145 | 1 | < 0.1% |
| 160 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 77001 | 1 | |
| 63000 | 1 | |
| 51000 | 1 | |
| 40664 | 1 | |
| 31940 | 1 | |
| 29886 | 1 | |
| 28915 | 1 | |
| 19063 | 1 | |
| 15294 | 1 | |
| 13498 | 1 |
vstrechpredst
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.969505 |
| Minimum | 0 |
|---|---|
| Maximum | 28502 |
| Zeros | 99976 |
| Zeros (%) | > 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 28502 |
| Range | 28502 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 115.38246 |
|---|---|
| Coefficient of variation (CV) | 119.01173 |
| Kurtosis | 40966.656 |
| Mean | 0.969505 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 185.77218 |
| Sum | 96950.5 |
| Variance | 13313.113 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 99976 | |
| 5 | 6 | < 0.1% |
| 1550 | 3 | < 0.1% |
| 4000 | 2 | < 0.1% |
| 6000 | 2 | < 0.1% |
| 959.5 | 1 | < 0.1% |
| 5000 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 3100 | 1 | < 0.1% |
| 2000 | 1 | < 0.1% |
| Other values (6) | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 99976 | |
| 5 | 6 | < 0.1% |
| 10 | 1 | < 0.1% |
| 199 | 1 | < 0.1% |
| 959.5 | 1 | < 0.1% |
| 1000 | 1 | < 0.1% |
| 1550 | 3 | < 0.1% |
| 2000 | 1 | < 0.1% |
| 3100 | 1 | < 0.1% |
| 4000 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 28502 | 1 | < 0.1% |
| 15000 | 1 | < 0.1% |
| 10000 | 1 | < 0.1% |
| 6500 | 1 | < 0.1% |
| 6000 | 2 | |
| 5000 | 1 | < 0.1% |
| 4000 | 2 | |
| 3100 | 1 | < 0.1% |
| 2000 | 1 | < 0.1% |
| 1550 | 3 |
Тип региона
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 219 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
| обл | |
|---|---|
| край | |
| Респ | |
| г | |
| 0 | 2780 |
| Other values (2) | 245 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.1412193 |
| Min length | 1 |
Characters and Unicode
| Total characters | 313434 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | обл |
|---|---|
| 2nd row | обл |
| 3rd row | Респ |
| 4th row | обл |
| 5th row | край |
Common Values
| Value | Count | Frequency (%) |
| обл | 56440 | |
| край | 20728 | 20.7% |
| Респ | 12778 | 12.8% |
| г | 6810 | 6.8% |
| 0 | 2780 | 2.8% |
| АО | 240 | 0.2% |
| Аобл | 5 | < 0.1% |
| (Missing) | 219 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| обл | 56440 | |
| край | 20728 | 20.8% |
| респ | 12778 | 12.8% |
| г | 6810 | 6.8% |
| 0 | 2780 | 2.8% |
| ао | 240 | 0.2% |
| аобл | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| о | 56445 | |
| б | 56445 | |
| л | 56445 | |
| к | 20728 | 6.6% |
| р | 20728 | 6.6% |
| а | 20728 | 6.6% |
| й | 20728 | 6.6% |
| Р | 12778 | 4.1% |
| е | 12778 | 4.1% |
| с | 12778 | 4.1% |
| Other values (5) | 22853 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 297391 | |
| Uppercase Letter | 13263 | 4.2% |
| Decimal Number | 2780 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| о | 56445 | |
| б | 56445 | |
| л | 56445 | |
| к | 20728 | 7.0% |
| р | 20728 | 7.0% |
| а | 20728 | 7.0% |
| й | 20728 | 7.0% |
| е | 12778 | 4.3% |
| с | 12778 | 4.3% |
| п | 12778 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Р | 12778 | |
| А | 245 | 1.8% |
| О | 240 | 1.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2780 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 310654 | |
| Common | 2780 | 0.9% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| о | 56445 | |
| б | 56445 | |
| л | 56445 | |
| к | 20728 | 6.7% |
| р | 20728 | 6.7% |
| а | 20728 | 6.7% |
| й | 20728 | 6.7% |
| Р | 12778 | 4.1% |
| е | 12778 | 4.1% |
| с | 12778 | 4.1% |
| Other values (4) | 20073 | 6.5% |
Common
| Value | Count | Frequency (%) |
| 0 | 2780 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 310654 | |
| ASCII | 2780 | 0.9% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| о | 56445 | |
| б | 56445 | |
| л | 56445 | |
| к | 20728 | 6.7% |
| р | 20728 | 6.7% |
| а | 20728 | 6.7% |
| й | 20728 | 6.7% |
| Р | 12778 | 4.1% |
| е | 12778 | 4.1% |
| с | 12778 | 4.1% |
| Other values (4) | 20073 | 6.5% |
ASCII
| Value | Count | Frequency (%) |
| 0 | 2780 |
Имя региона
Text
| Distinct | 86 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 219 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 19 |
| Mean length | 10.321825 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1029922 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Курганская |
|---|---|
| 2nd row | Тверская |
| 3rd row | Кабардино-Балкарская |
| 4th row | Иркутская |
| 5th row | Ставропольский |
| Value | Count | Frequency (%) |
| челябинская | 7388 | 7.2% |
| хабаровский | 5110 | 5.0% |
| краснодарский | 4635 | 4.5% |
| свердловская | 3656 | 3.6% |
| тульская | 2969 | 2.9% |
| новосибирская | 2781 | 2.7% |
| 0 | 2780 | 2.7% |
| пензенская | 2551 | 2.5% |
| санкт-петербург | 2376 | 2.3% |
| московская | 2367 | 2.3% |
| Other values (85) | 65835 |
Most occurring characters
| Value | Count | Frequency (%) |
| а | 131299 | |
| с | 102973 | 10.0% |
| к | 93565 | 9.1% |
| я | 73852 | 7.2% |
| р | 72341 | 7.0% |
| о | 71740 | 7.0% |
| и | 49571 | 4.8% |
| е | 40498 | 3.9% |
| в | 39113 | 3.8% |
| н | 36656 | 3.6% |
| Other values (46) | 318314 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 916372 | |
| Uppercase Letter | 102407 | 9.9% |
| Dash Punctuation | 3464 | 0.3% |
| Decimal Number | 2780 | 0.3% |
| Space Separator | 2667 | 0.3% |
| Open Punctuation | 951 | 0.1% |
| Close Punctuation | 951 | 0.1% |
| Other Punctuation | 330 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| а | 131299 | |
| с | 102973 | |
| к | 93565 | |
| я | 73852 | 8.1% |
| р | 72341 | 7.9% |
| о | 71740 | 7.8% |
| и | 49571 | 5.4% |
| е | 40498 | 4.4% |
| в | 39113 | 4.3% |
| н | 36656 | 4.0% |
| Other values (18) | 204764 |
Uppercase Letter
| Value | Count | Frequency (%) |
| К | 19071 | |
| С | 13959 | |
| Ч | 9473 | |
| П | 9143 | |
| Т | 8317 | |
| Х | 5853 | 5.7% |
| М | 5607 | 5.5% |
| Н | 4937 | 4.8% |
| Б | 4063 | 4.0% |
| В | 3967 | 3.9% |
| Other values (12) | 18017 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3464 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2780 |
Space Separator
| Value | Count | Frequency (%) |
| 2667 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 951 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 951 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 330 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 1018779 | |
| Common | 11143 | 1.1% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| а | 131299 | |
| с | 102973 | 10.1% |
| к | 93565 | 9.2% |
| я | 73852 | 7.2% |
| р | 72341 | 7.1% |
| о | 71740 | 7.0% |
| и | 49571 | 4.9% |
| е | 40498 | 4.0% |
| в | 39113 | 3.8% |
| н | 36656 | 3.6% |
| Other values (40) | 307171 |
Common
| Value | Count | Frequency (%) |
| - | 3464 | |
| 0 | 2780 | |
| 2667 | ||
| ( | 951 | 8.5% |
| ) | 951 | 8.5% |
| / | 330 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 1018779 | |
| ASCII | 11143 | 1.1% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| а | 131299 | |
| с | 102973 | 10.1% |
| к | 93565 | 9.2% |
| я | 73852 | 7.2% |
| р | 72341 | 7.1% |
| о | 71740 | 7.0% |
| и | 49571 | 4.9% |
| е | 40498 | 4.0% |
| в | 39113 | 3.8% |
| н | 36656 | 3.6% |
| Other values (40) | 307171 |
ASCII
| Value | Count | Frequency (%) |
| - | 3464 | |
| 0 | 2780 | |
| 2667 | ||
| ( | 951 | 8.5% |
| ) | 951 | 8.5% |
| / | 330 | 3.0% |
Тип города
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 219 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
| г | |
|---|---|
| 0 | |
| пгт | 65 |
| с/п | 13 |
| рп | 10 |
Length
| Max length | 6 |
|---|---|
| Median length | 1 |
| Mean length | 1.0017639 |
| Min length | 1 |
Characters and Unicode
| Total characters | 99957 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | г |
|---|---|
| 2nd row | г |
| 3rd row | г |
| 4th row | г |
| 5th row | г |
Common Values
| Value | Count | Frequency (%) |
| г | 79369 | |
| 0 | 20322 | 20.3% |
| пгт | 65 | 0.1% |
| с/п | 13 | < 0.1% |
| рп | 10 | < 0.1% |
| массив | 2 | < 0.1% |
| (Missing) | 219 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| г | 79369 | |
| 0 | 20322 | 20.4% |
| пгт | 65 | 0.1% |
| с/п | 13 | < 0.1% |
| рп | 10 | < 0.1% |
| массив | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| г | 79434 | |
| 0 | 20322 | 20.3% |
| п | 88 | 0.1% |
| т | 65 | 0.1% |
| с | 17 | < 0.1% |
| / | 13 | < 0.1% |
| р | 10 | < 0.1% |
| м | 2 | < 0.1% |
| а | 2 | < 0.1% |
| и | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 79622 | |
| Decimal Number | 20322 | 20.3% |
| Other Punctuation | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| г | 79434 | |
| п | 88 | 0.1% |
| т | 65 | 0.1% |
| с | 17 | < 0.1% |
| р | 10 | < 0.1% |
| м | 2 | < 0.1% |
| а | 2 | < 0.1% |
| и | 2 | < 0.1% |
| в | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 20322 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 79622 | |
| Common | 20335 | 20.3% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| г | 79434 | |
| п | 88 | 0.1% |
| т | 65 | 0.1% |
| с | 17 | < 0.1% |
| р | 10 | < 0.1% |
| м | 2 | < 0.1% |
| а | 2 | < 0.1% |
| и | 2 | < 0.1% |
| в | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 0 | 20322 | |
| / | 13 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 79622 | |
| ASCII | 20335 | 20.3% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| г | 79434 | |
| п | 88 | 0.1% |
| т | 65 | 0.1% |
| с | 17 | < 0.1% |
| р | 10 | < 0.1% |
| м | 2 | < 0.1% |
| а | 2 | < 0.1% |
| и | 2 | < 0.1% |
| в | 2 | < 0.1% |
ASCII
| Value | Count | Frequency (%) |
| 0 | 20322 | |
| / | 13 | 0.1% |
Имя города
Text
| Distinct | 701 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 219 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 6.7247572 |
| Min length | 1 |
Characters and Unicode
| Total characters | 671003 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 64 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Курган |
|---|---|
| 2nd row | Торжок |
| 3rd row | Баксан |
| 4th row | Бодайбо |
| 5th row | Ставрополь |
| Value | Count | Frequency (%) |
| 0 | 20322 | 19.6% |
| хабаровск | 4186 | 4.0% |
| челябинск | 2483 | 2.4% |
| новосибирск | 2424 | 2.3% |
| тула | 2241 | 2.2% |
| чита | 1688 | 1.6% |
| екатеринбург | 1589 | 1.5% |
| пермь | 1386 | 1.3% |
| нижний | 1380 | 1.3% |
| новокузнецк | 1380 | 1.3% |
| Other values (716) | 64424 |
Most occurring characters
| Value | Count | Frequency (%) |
| а | 59128 | 8.8% |
| о | 56136 | 8.4% |
| р | 47007 | 7.0% |
| с | 45473 | 6.8% |
| к | 45318 | 6.8% |
| и | 37494 | 5.6% |
| н | 35303 | 5.3% |
| е | 31699 | 4.7% |
| л | 27808 | 4.1% |
| в | 23639 | 3.5% |
| Other values (55) | 261998 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 558088 | |
| Uppercase Letter | 85717 | 12.8% |
| Decimal Number | 20323 | 3.0% |
| Space Separator | 3722 | 0.6% |
| Dash Punctuation | 3153 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| а | 59128 | 10.6% |
| о | 56136 | 10.1% |
| р | 47007 | 8.4% |
| с | 45473 | 8.1% |
| к | 45318 | 8.1% |
| и | 37494 | 6.7% |
| н | 35303 | 6.3% |
| е | 31699 | 5.7% |
| л | 27808 | 5.0% |
| в | 23639 | 4.2% |
| Other values (22) | 149083 |
Uppercase Letter
| Value | Count | Frequency (%) |
| К | 11849 | |
| Н | 9265 | 10.8% |
| С | 6273 | 7.3% |
| Ч | 6093 | 7.1% |
| Т | 5425 | 6.3% |
| В | 5372 | 6.3% |
| П | 4487 | 5.2% |
| Х | 4363 | 5.1% |
| О | 3987 | 4.7% |
| Б | 3796 | 4.4% |
| Other values (19) | 24807 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 20322 | |
| 2 | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3722 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3153 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 643805 | |
| Common | 27198 | 4.1% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| а | 59128 | 9.2% |
| о | 56136 | 8.7% |
| р | 47007 | 7.3% |
| с | 45473 | 7.1% |
| к | 45318 | 7.0% |
| и | 37494 | 5.8% |
| н | 35303 | 5.5% |
| е | 31699 | 4.9% |
| л | 27808 | 4.3% |
| в | 23639 | 3.7% |
| Other values (51) | 234800 |
Common
| Value | Count | Frequency (%) |
| 0 | 20322 | |
| 3722 | 13.7% | |
| - | 3153 | 11.6% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 643805 | |
| ASCII | 27198 | 4.1% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| а | 59128 | 9.2% |
| о | 56136 | 8.7% |
| р | 47007 | 7.3% |
| с | 45473 | 7.1% |
| к | 45318 | 7.0% |
| и | 37494 | 5.8% |
| н | 35303 | 5.5% |
| е | 31699 | 4.9% |
| л | 27808 | 4.3% |
| в | 23639 | 3.7% |
| Other values (51) | 234800 |
ASCII
| Value | Count | Frequency (%) |
| 0 | 20322 | |
| 3722 | 13.7% | |
| - | 3153 | 11.6% |
| 2 | 1 | < 0.1% |
Тип улицы
Categorical
IMBALANCE 
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 219 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
| ул | |
|---|---|
| пр-кт | |
| 0 | 4672 |
| ш | 2649 |
| пер | 1510 |
| Other values (21) | 5283 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.3023522 |
| Min length | 1 |
Characters and Unicode
| Total characters | 229731 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ул |
|---|---|
| 2nd row | ул |
| 3rd row | пр-кт |
| 4th row | ул |
| 5th row | ул |
Common Values
| Value | Count | Frequency (%) |
| ул | 76000 | |
| пр-кт | 9667 | 9.7% |
| 0 | 4672 | 4.7% |
| ш | 2649 | 2.6% |
| пер | 1510 | 1.5% |
| пл | 1497 | 1.5% |
| мкр | 992 | 1.0% |
| проезд | 853 | 0.9% |
| б-р | 739 | 0.7% |
| км | 375 | 0.4% |
| Other values (16) | 827 | 0.8% |
Length
| Value | Count | Frequency (%) |
| ул | 76000 | |
| пр-кт | 9667 | 9.7% |
| 0 | 4672 | 4.7% |
| ш | 2649 | 2.7% |
| пер | 1510 | 1.5% |
| пл | 1497 | 1.5% |
| мкр | 992 | 1.0% |
| проезд | 853 | 0.9% |
| б-р | 739 | 0.7% |
| км | 375 | 0.4% |
| Other values (16) | 827 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| л | 77755 | |
| у | 76056 | |
| р | 14155 | 6.2% |
| п | 13680 | 6.0% |
| к | 11486 | 5.0% |
| - | 10476 | 4.6% |
| т | 10372 | 4.5% |
| 0 | 4672 | 2.0% |
| ш | 2649 | 1.2% |
| е | 2483 | 1.1% |
| Other values (12) | 5947 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 214580 | |
| Dash Punctuation | 10476 | 4.6% |
| Decimal Number | 4672 | 2.0% |
| Other Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| л | 77755 | |
| у | 76056 | |
| р | 14155 | 6.6% |
| п | 13680 | 6.4% |
| к | 11486 | 5.4% |
| т | 10372 | 4.8% |
| ш | 2649 | 1.2% |
| е | 2483 | 1.2% |
| м | 1367 | 0.6% |
| д | 887 | 0.4% |
| Other values (9) | 3690 | 1.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10476 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4672 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 214580 | |
| Common | 15151 | 6.6% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| л | 77755 | |
| у | 76056 | |
| р | 14155 | 6.6% |
| п | 13680 | 6.4% |
| к | 11486 | 5.4% |
| т | 10372 | 4.8% |
| ш | 2649 | 1.2% |
| е | 2483 | 1.2% |
| м | 1367 | 0.6% |
| д | 887 | 0.4% |
| Other values (9) | 3690 | 1.7% |
Common
| Value | Count | Frequency (%) |
| - | 10476 | |
| 0 | 4672 | |
| / | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 214580 | |
| ASCII | 15151 | 6.6% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| л | 77755 | |
| у | 76056 | |
| р | 14155 | 6.6% |
| п | 13680 | 6.4% |
| к | 11486 | 5.4% |
| т | 10372 | 4.8% |
| ш | 2649 | 1.2% |
| е | 2483 | 1.2% |
| м | 1367 | 0.6% |
| д | 887 | 0.4% |
| Other values (9) | 3690 | 1.7% |
ASCII
| Value | Count | Frequency (%) |
| - | 10476 | |
| 0 | 4672 | |
| / | 3 | < 0.1% |
Имя улицы
Text
| Distinct | 2688 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 219 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 30 |
| Mean length | 9.6897906 |
| Min length | 1 |
Characters and Unicode
| Total characters | 966857 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 8 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 470 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Коли Мяготина |
|---|---|
| 2nd row | Луначарского |
| 3rd row | Ленина |
| 4th row | Карла Либкнехта |
| 5th row | Ленина |
| Value | Count | Frequency (%) |
| ленина | 5494 | 4.6% |
| 0 | 4672 | 3.9% |
| советская | 2169 | 1.8% |
| им | 1682 | 1.4% |
| мира | 1597 | 1.3% |
| карла | 1531 | 1.3% |
| победы | 1454 | 1.2% |
| маркса | 1397 | 1.2% |
| октября | 1140 | 1.0% |
| октябрьская | 1069 | 0.9% |
| Other values (2716) | 96664 |
Most occurring characters
| Value | Count | Frequency (%) |
| а | 120693 | 12.5% |
| о | 92737 | 9.6% |
| е | 59998 | 6.2% |
| н | 58006 | 6.0% |
| к | 53750 | 5.6% |
| р | 52138 | 5.4% |
| и | 49990 | 5.2% |
| с | 46892 | 4.8% |
| в | 45781 | 4.7% |
| я | 39826 | 4.1% |
| Other values (70) | 347046 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 818858 | |
| Uppercase Letter | 109037 | 11.3% |
| Space Separator | 19088 | 2.0% |
| Decimal Number | 13183 | 1.4% |
| Other Punctuation | 3575 | 0.4% |
| Dash Punctuation | 3034 | 0.3% |
| Open Punctuation | 41 | < 0.1% |
| Close Punctuation | 41 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| а | 120693 | |
| о | 92737 | |
| е | 59998 | 7.3% |
| н | 58006 | 7.1% |
| к | 53750 | 6.6% |
| р | 52138 | 6.4% |
| и | 49990 | 6.1% |
| с | 46892 | 5.7% |
| в | 45781 | 5.6% |
| я | 39826 | 4.9% |
| Other values (23) | 199047 |
Uppercase Letter
| Value | Count | Frequency (%) |
| К | 14032 | |
| П | 10590 | 9.7% |
| М | 10259 | 9.4% |
| С | 9912 | 9.1% |
| Л | 9209 | 8.4% |
| Б | 6950 | 6.4% |
| Г | 5644 | 5.2% |
| В | 4813 | 4.4% |
| О | 4615 | 4.2% |
| Т | 3670 | 3.4% |
| Other values (20) | 29343 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6562 | |
| 2 | 1524 | 11.6% |
| 5 | 1075 | 8.2% |
| 1 | 915 | 6.9% |
| 4 | 864 | 6.6% |
| 9 | 727 | 5.5% |
| 3 | 639 | 4.8% |
| 6 | 461 | 3.5% |
| 8 | 318 | 2.4% |
| 7 | 98 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3464 | |
| ' | 96 | 2.7% |
| / | 15 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 19088 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3034 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 41 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 41 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 927894 | |
| Common | 38962 | 4.0% |
| Latin | 1 | < 0.1% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| а | 120693 | 13.0% |
| о | 92737 | 10.0% |
| е | 59998 | 6.5% |
| н | 58006 | 6.3% |
| к | 53750 | 5.8% |
| р | 52138 | 5.6% |
| и | 49990 | 5.4% |
| с | 46892 | 5.1% |
| в | 45781 | 4.9% |
| я | 39826 | 4.3% |
| Other values (52) | 308083 |
Common
| Value | Count | Frequency (%) |
| 19088 | ||
| 0 | 6562 | 16.8% |
| . | 3464 | 8.9% |
| - | 3034 | 7.8% |
| 2 | 1524 | 3.9% |
| 5 | 1075 | 2.8% |
| 1 | 915 | 2.3% |
| 4 | 864 | 2.2% |
| 9 | 727 | 1.9% |
| 3 | 639 | 1.6% |
| Other values (7) | 1070 | 2.7% |
Latin
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 927894 | |
| ASCII | 38963 | 4.0% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| а | 120693 | 13.0% |
| о | 92737 | 10.0% |
| е | 59998 | 6.5% |
| н | 58006 | 6.3% |
| к | 53750 | 5.8% |
| р | 52138 | 5.6% |
| и | 49990 | 5.4% |
| с | 46892 | 5.1% |
| в | 45781 | 4.9% |
| я | 39826 | 4.3% |
| Other values (52) | 308083 |
ASCII
| Value | Count | Frequency (%) |
| 19088 | ||
| 0 | 6562 | 16.8% |
| . | 3464 | 8.9% |
| - | 3034 | 7.8% |
| 2 | 1524 | 3.9% |
| 5 | 1075 | 2.8% |
| 1 | 915 | 2.3% |
| 4 | 864 | 2.2% |
| 9 | 727 | 1.9% |
| 3 | 639 | 1.6% |
| Other values (8) | 1071 | 2.7% |
Тип номера дома
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 219 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
| дом | |
|---|---|
| 0 | |
| в/ч | 15 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.7983183 |
| Min length | 1 |
Characters and Unicode
| Total characters | 279219 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | дом |
|---|---|
| 2nd row | дом |
| 3rd row | 0 |
| 4th row | дом |
| 5th row | дом |
Common Values
| Value | Count | Frequency (%) |
| дом | 89704 | |
| 0 | 10062 | 10.1% |
| в/ч | 15 | < 0.1% |
| (Missing) | 219 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| дом | 89704 | |
| 0 | 10062 | 10.1% |
| в/ч | 15 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| д | 89704 | |
| о | 89704 | |
| м | 89704 | |
| 0 | 10062 | 3.6% |
| в | 15 | < 0.1% |
| / | 15 | < 0.1% |
| ч | 15 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 269142 | |
| Decimal Number | 10062 | 3.6% |
| Other Punctuation | 15 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| д | 89704 | |
| о | 89704 | |
| м | 89704 | |
| в | 15 | < 0.1% |
| ч | 15 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10062 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 15 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Cyrillic | 269142 | |
| Common | 10077 | 3.6% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| д | 89704 | |
| о | 89704 | |
| м | 89704 | |
| в | 15 | < 0.1% |
| ч | 15 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 0 | 10062 | |
| / | 15 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Cyrillic | 269142 | |
| ASCII | 10077 | 3.6% |
Most frequent character per block
Cyrillic
| Value | Count | Frequency (%) |
| д | 89704 | |
| о | 89704 | |
| м | 89704 | |
| в | 15 | < 0.1% |
| ч | 15 | < 0.1% |
ASCII
| Value | Count | Frequency (%) |
| 0 | 10062 | |
| / | 15 | 0.1% |
Номер дома
Text
| Distinct | 1071 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 219 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 8.3280885 |
| Min length | 1 |
Characters and Unicode
| Total characters | 830985 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 149 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | "=""47""" |
|---|---|
| 2nd row | "=""37""" |
| 3rd row | 0 |
| 4th row | "=""59""" |
| 5th row | "=""318/2""" |
| Value | Count | Frequency (%) |
| 0 | 10062 | 10.1% |
| 1 | 4204 | 4.2% |
| 2 | 2592 | 2.6% |
| 5 | 2095 | 2.1% |
| 3 | 2086 | 2.1% |
| 7 | 2051 | 2.1% |
| 10 | 1800 | 1.8% |
| 6 | 1630 | 1.6% |
| 4 | 1626 | 1.6% |
| 16 | 1477 | 1.5% |
| Other values (1061) | 70162 |
Most occurring characters
| Value | Count | Frequency (%) |
| " | 538314 | |
| = | 89719 | 10.8% |
| 1 | 41644 | 5.0% |
| 2 | 26384 | 3.2% |
| 0 | 19273 | 2.3% |
| 4 | 16493 | 2.0% |
| 3 | 16383 | 2.0% |
| 5 | 14112 | 1.7% |
| 6 | 13529 | 1.6% |
| А | 12735 | 1.5% |
| Other values (26) | 42399 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Punctuation | 542891 | |
| Decimal Number | 179762 | 21.6% |
| Math Symbol | 89719 | 10.8% |
| Uppercase Letter | 18552 | 2.2% |
| Dash Punctuation | 48 | < 0.1% |
| Connector Punctuation | 9 | < 0.1% |
| Space Separator | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| А | 12735 | |
| Б | 3115 | 16.8% |
| Г | 966 | 5.2% |
| В | 923 | 5.0% |
| Д | 384 | 2.1% |
| Н | 203 | 1.1% |
| Е | 57 | 0.3% |
| М | 43 | 0.2% |
| Ж | 32 | 0.2% |
| С | 31 | 0.2% |
| Other values (9) | 63 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 41644 | |
| 2 | 26384 | |
| 0 | 19273 | |
| 4 | 16493 | 9.2% |
| 3 | 16383 | 9.1% |
| 5 | 14112 | 7.9% |
| 6 | 13529 | 7.5% |
| 7 | 12067 | 6.7% |
| 8 | 10999 | 6.1% |
| 9 | 8878 | 4.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 538314 | |
| / | 4544 | 0.8% |
| \ | 33 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 89719 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 48 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 9 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 812433 | |
| Cyrillic | 18552 | 2.2% |
Most frequent character per script
Cyrillic
| Value | Count | Frequency (%) |
| А | 12735 | |
| Б | 3115 | 16.8% |
| Г | 966 | 5.2% |
| В | 923 | 5.0% |
| Д | 384 | 2.1% |
| Н | 203 | 1.1% |
| Е | 57 | 0.3% |
| М | 43 | 0.2% |
| Ж | 32 | 0.2% |
| С | 31 | 0.2% |
| Other values (9) | 63 | 0.3% |
Common
| Value | Count | Frequency (%) |
| " | 538314 | |
| = | 89719 | 11.0% |
| 1 | 41644 | 5.1% |
| 2 | 26384 | 3.2% |
| 0 | 19273 | 2.4% |
| 4 | 16493 | 2.0% |
| 3 | 16383 | 2.0% |
| 5 | 14112 | 1.7% |
| 6 | 13529 | 1.7% |
| 7 | 12067 | 1.5% |
| Other values (7) | 24515 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 812433 | |
| Cyrillic | 18552 | 2.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| " | 538314 | |
| = | 89719 | 11.0% |
| 1 | 41644 | 5.1% |
| 2 | 26384 | 3.2% |
| 0 | 19273 | 2.4% |
| 4 | 16493 | 2.0% |
| 3 | 16383 | 2.0% |
| 5 | 14112 | 1.7% |
| 6 | 13529 | 1.7% |
| 7 | 12067 | 1.5% |
| Other values (7) | 24515 | 3.0% |
Cyrillic
| Value | Count | Frequency (%) |
| А | 12735 | |
| Б | 3115 | 16.8% |
| Г | 966 | 5.2% |
| В | 923 | 5.0% |
| Д | 384 | 2.1% |
| Н | 203 | 1.1% |
| Е | 57 | 0.3% |
| М | 43 | 0.2% |
| Ж | 32 | 0.2% |
| С | 31 | 0.2% |
| Other values (9) | 63 | 0.3% |
| receiptid | kkt_sn | amount | price | cost | nal | electron | avans | credit | vstrechpredst | Тип региона | Тип города | Тип улицы | Тип номера дома | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| receiptid | 1.000 | 0.020 | 0.000 | -0.005 | -0.004 | -0.006 | 0.010 | 0.002 | -0.005 | -0.009 | 0.021 | 0.016 | 0.024 | 0.028 |
| kkt_sn | 0.020 | 1.000 | -0.103 | -0.068 | -0.093 | -0.055 | 0.017 | 0.026 | -0.016 | 0.009 | 0.179 | 0.056 | 0.156 | 0.088 |
| amount | 0.000 | -0.103 | 1.000 | -0.219 | 0.038 | 0.074 | -0.059 | 0.009 | 0.008 | 0.014 | 0.087 | 0.000 | 0.089 | 0.000 |
| price | -0.005 | -0.068 | -0.219 | 1.000 | 0.950 | 0.326 | 0.241 | 0.013 | 0.009 | 0.009 | 0.400 | 0.381 | 0.162 | 0.195 |
| cost | -0.004 | -0.093 | 0.038 | 0.950 | 1.000 | 0.362 | 0.233 | 0.015 | 0.010 | 0.012 | 0.383 | 0.348 | 0.139 | 0.152 |
| nal | -0.006 | -0.055 | 0.074 | 0.326 | 0.362 | 1.000 | -0.725 | -0.031 | -0.017 | -0.012 | 0.210 | 0.235 | 0.139 | 0.000 |
| electron | 0.010 | 0.017 | -0.059 | 0.241 | 0.233 | -0.725 | 1.000 | -0.011 | -0.005 | -0.001 | 0.288 | 0.164 | 0.000 | 0.288 |
| avans | 0.002 | 0.026 | 0.009 | 0.013 | 0.015 | -0.031 | -0.011 | 1.000 | 0.110 | 0.140 | 0.000 | 0.000 | 0.000 | 0.000 |
| credit | -0.005 | -0.016 | 0.008 | 0.009 | 0.010 | -0.017 | -0.005 | 0.110 | 1.000 | 0.196 | 0.007 | 0.000 | 0.000 | 0.000 |
| vstrechpredst | -0.009 | 0.009 | 0.014 | 0.009 | 0.012 | -0.012 | -0.001 | 0.140 | 0.196 | 1.000 | 0.039 | 0.000 | 0.063 | 0.002 |
| Тип региона | 0.021 | 0.179 | 0.087 | 0.400 | 0.383 | 0.210 | 0.288 | 0.000 | 0.007 | 0.039 | 1.000 | 0.278 | 0.336 | 0.359 |
| Тип города | 0.016 | 0.056 | 0.000 | 0.381 | 0.348 | 0.235 | 0.164 | 0.000 | 0.000 | 0.000 | 0.278 | 1.000 | 0.166 | 0.127 |
| Тип улицы | 0.024 | 0.156 | 0.089 | 0.162 | 0.139 | 0.139 | 0.000 | 0.000 | 0.000 | 0.063 | 0.336 | 0.166 | 1.000 | 0.396 |
| Тип номера дома | 0.028 | 0.088 | 0.000 | 0.195 | 0.152 | 0.000 | 0.288 | 0.000 | 0.000 | 0.002 | 0.359 | 0.127 | 0.396 | 1.000 |
| receiptid | kkt_sn | d_date | name | amount | unit | price | cost | nal | electron | avans | credit | vstrechpredst | Тип региона | Имя региона | Тип города | Имя города | Тип улицы | Имя улицы | Тип номера дома | Номер дома | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6857952 | 78257434 | 108205053817 | 2018-07-23 | 0001 СТРОИТЕЛЬНЫЕ И ОТДЕЛОЧНЫЕ МАТЕРИАЛЫ | 1.0 | -- | 1947.5 | 1947.5 | 0.0 | 1947.5 | 0.0 | 0.0 | 0.0 | обл | Курганская | г | Курган | ул | Коли Мяготина | дом | "=""47""" |
| 9633993 | 76433658 | 108201822881 | 2018-07-18 | 0001 ТОВАР | 1.0 | -- | 25.0 | 25.0 | 25.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Тверская | г | Торжок | ул | Луначарского | дом | "=""37""" |
| 2780605 | 81606801 | 108206691300 | 2018-07-31 | 0001 ЭЛЕКТРОЭНЕРГИЯ | 1.0 | -- | 272.0 | 272.0 | 272.0 | 0.0 | 0.0 | 0.0 | 0.0 | Респ | Кабардино-Балкарская | г | Баксан | пр-кт | Ленина | 0 | 0 |
| 8039130 | 70954824 | 108206440469 | 2018-07-01 | 0001 ТОВАР | 1.0 | штука | 337.0 | 337.0 | 337.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Иркутская | г | Бодайбо | ул | Карла Либкнехта | дом | "=""59""" |
| 8406613 | 71787576 | 108202503230 | 2018-07-05 | 0001 ТОВАР ПО СВОБОДНОЙ ЦЕНЕ | 1.0 | -- | 120.0 | 120.0 | 120.0 | 0.0 | 0.0 | 0.0 | 0.0 | край | Ставропольский | г | Ставрополь | ул | Ленина | дом | "=""318/2""" |
| 2024067 | 70501518 | 108202271399 | 2018-06-30 | 0001 БАКАЛЕЙНАЯ ПРОДУКЦИЯ | 1.0 | -- | 134.0 | 134.0 | 0.0 | 1090.04 | 0.0 | 0.0 | 0.0 | Респ | Татарстан | г | Нижнекамск | ул | Баки Урманче | дом | "=""2А""" |
| 2936578 | 71484714 | 108203422422 | 2018-07-04 | 0322 ПЕЛЬМЕНИ "ПЕТУШОК" /0,8/ П/Э (ЗАМ) | 1.0 | килограмм | 125.0 | 125.0 | 1034.91 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Омская | г | Омск | ул | Труда | дом | "=""21""" |
| 9620217 | 73258498 | 108208504672 | 2018-07-10 | 0001 ТОВАРЫ БАКАЛЕЙНОЙ ГРУППЫ | 1.0 | -- | 50.0 | 50.0 | 50.0 | 0.0 | 0.0 | 0.0 | 0.0 | Респ | Чувашская (Чувашия) | г | Новочебоксарск | ул | 10 Пятилетки | дом | "=""12""" |
| 3005126 | 60576185 | 108202264290 | 2018-06-05 | 0001 ТОВАР | 1.0 | -- | 62.0 | 62.0 | 62.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Костромская | 0 | 0 | ул | Костромская | дом | "=""2Б""" |
| 9734165 | 80426274 | 108207305561 | 2018-07-29 | 0001 Яйцо | 1.0 | руб. | 156.0 | 156.0 | 156.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Липецкая | г | Елец | ул | Овражная | дом | "=""39""" |
| receiptid | kkt_sn | d_date | name | amount | unit | price | cost | nal | electron | avans | credit | vstrechpredst | Тип региона | Имя региона | Тип города | Имя города | Тип улицы | Имя улицы | Тип номера дома | Номер дома | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4930566 | 77311139 | 108201016310 | 2018-07-21 | 0001 Продукт | 1.0 | -- | 1850.0 | 1850.0 | 1850.0 | 0.0 | 0.0 | 0.0 | 0.0 | Респ | Саха /Якутия/ | г | Ленск | ул | Нюйская | дом | "=""58""" |
| 2032747 | 68364988 | 108202271399 | 2018-06-25 | 0001 БАКАЛЕЙНАЯ ПРОДУКЦИЯ | 1.0 | -- | 81.0 | 81.0 | 0.0 | 433.41 | 0.0 | 0.0 | 0.0 | Респ | Татарстан | г | Нижнекамск | ул | Баки Урманче | дом | "=""2А""" |
| 5568974 | 67718777 | 108200438782 | 2018-06-24 | 0003 ПОСТЕЛЬНОЕ БЕЛЬЕ | 1.0 | -- | 190.0 | 190.0 | 0.0 | 980.0 | 0.0 | 0.0 | 0.0 | обл | Новгородская | г | Боровичи | ул | Коммунарная | дом | "=""51""" |
| 9183434 | 65868830 | 108201345299 | 2018-06-09 | 0001 ОХЛАЖДЕННОЕ МЯСО | 1.0 | килограмм | 618.0 | 618.0 | 0.0 | 618.0 | 0.0 | 0.0 | 0.0 | г | Санкт-Петербург | 0 | 0 | ул | Наличная | дом | "=""42""" |
| 2226299 | 73621956 | 108206075594 | 2018-07-11 | 0001 ТОВАР | 1.0 | -- | 102.0 | 102.0 | 102.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Кемеровская | г | Новокузнецк | ул | Кутузова | дом | "=""5""" |
| 4747959 | 62990917 | 108202436268 | 2018-06-13 | 0002 СОПУТСТВУЮЩИЕ ТОВАРЫ | 10.0 | штука | 2.0 | 20.0 | 2030.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Кемеровская | г | Новокузнецк | ул | Ленина | дом | "=""72/1""" |
| 9807248 | 69881302 | 108206234224 | 2018-06-29 | 0001 ЭЛЕКТРОЭНЕРГИЯ | 1.0 | -- | 1500.0 | 1500.0 | 1500.0 | 0.0 | 0.0 | 0.0 | 0.0 | Респ | Дагестан | г | Избербаш | ул | Буйнакского | дом | "=""197""" |
| 116464 | 66403225 | 108207865154 | 2018-06-21 | 0001 ТОВАР НА СУММУ | 1.0 | -- | 55.0 | 55.0 | 0.0 | 55.0 | 0.0 | 0.0 | 0.0 | обл | Оренбургская | 0 | 0 | ул | Центральная | дом | "=""3""" |
| 6974799 | 66916679 | 108407540550 | 2018-06-22 | 0006 ХЛЕБ | 1.0 | -- | 18.0 | 18.0 | 91.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Свердловская | г | Нижний Тагил | ул | Захарова | дом | "=""1А""" |
| 775923 | 62250984 | 108208798408 | 2018-06-10 | 0001 ТОВАР НА СУММУ | 1.0 | -- | 39.0 | 39.0 | 39.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Оренбургская | г | Оренбург | ул | Конституции СССР | дом | "=""20""" |
Most frequently occurring
| receiptid | kkt_sn | d_date | name | amount | unit | price | cost | nal | electron | avans | credit | vstrechpredst | Тип региона | Имя региона | Тип города | Имя города | Тип улицы | Имя улицы | Тип номера дома | Номер дома | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3 | 62692882 | 108209650027 | 2018-06-06 | 0001 ТОВАР | 1.0 | -- | 250.0 | 250.0 | 7526.0 | 0.0 | 0.0 | 0.0 | 0.0 | Респ | Башкортостан | г | Стерлитамак | ул | Суханова | дом | "=""4""" | 3 |
| 9 | 66248040 | 108201798101 | 2018-06-20 | 0004 4 | 1.0 | -- | 100.0 | 100.0 | 15995.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Воронежская | г | Воронеж | проезд | Монтажный | дом | "=""2""" | 3 |
| 0 | 60114715 | 108201782774 | 2018-06-04 | 0004 ТОВАР 4 | 1.0 | -- | 100.0 | 100.0 | 424.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Воронежская | г | Воронеж | пр-кт | Московский | дом | "=""131Б""" | 2 |
| 1 | 61191640 | 108406418542 | 2018-06-07 | 0101 Самса 130г. | 1.0 | шт | 34.0 | 34.0 | 126.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Липецкая | г | Липецк | ул | З.Космодемьянской | дом | "=""2Б""" | 2 |
| 2 | 61411634 | 108207359837 | 2018-06-08 | 0008 Хлеб бородинский | 1.0 | -- | 35.0 | 35.0 | 105.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Новосибирская | г | Новосибирск | ул | Петухова | дом | "=""69""" | 2 |
| 4 | 62980098 | 108205647723 | 2018-06-12 | 0003 Хлеб на сыворотке 0,3 гр | 1.0 | -- | 12.0 | 12.0 | 120.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Новосибирская | г | Новосибирск | ул | Большевистская | дом | "=""131/2""" | 2 |
| 5 | 63357438 | 108203694789 | 2018-06-09 | 0008 БИЛЕТ ВЗРОСЛЫЙ | 1.0 | -- | 120.0 | 120.0 | 360.0 | 0.0 | 0.0 | 0.0 | 0.0 | Респ | Чувашская (Чувашия) | г | Чебоксары | ул | Космонавта Николаева А.Г. | дом | "=""6""" | 2 |
| 6 | 63477898 | 108207707384 | 2018-06-13 | 0011 ГАЗЕТА | 1.0 | -- | 9.0 | 9.0 | 486.0 | 0.0 | 0.0 | 0.0 | 0.0 | обл | Челябинская | г | Миасс | пр-кт | Автозаводцев | дом | "=""36""" | 2 |
| 7 | 65203807 | 108401779671 | 2018-06-18 | 0136 С-т Крабовый | 1.0 | -- | 40.0 | 40.0 | 0.0 | 360.0 | 0.0 | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| 8 | 65645217 | 108205304022 | 2018-06-19 | 0002 ПЛОВ БОЛЬШОЙ | 1.0 | штука | 140.0 | 140.0 | 1480.0 | 0.0 | 0.0 | 0.0 | 0.0 | Респ | Крым | г | Симферополь | ул | Пушкина | дом | "=""46""" | 2 |